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Abstract 

We study the complexity of constraint satisfaction problems for templates F that are first- 
order definable in (Z; succ), the integers with the successor relation. In the case that T is locally 
CO \ finite (i.e., the Gaifman graph of T has finite degree), we show that T is homomorphically 

equivalent to a structure with a certain majority polymorphism (which we call modular median) 
and the CSP for T can be solved in polynomial time, or F is homomorphically equivalent to 
• a finite transitive structure, or the CSP for F is NP-complete. Assuming a widely believed 

\ conjecture from finite domain constraint satisfaction (we require the tractability conjecture by 

^ ■ Bulatov, Jeavons and Krokhin in the special case of transitive finite templates), this proves that 

Q I those CSPs have a complexity dichotomy, that is, are either in P or NP-complete. 

^ ! 1 Introduction 

. Constraint satisfaction problems appear naturally in many areas of theoretical computer science, 

for example in artificial intelligence, optimization, computer algebra, computational biology, 
computational linguistics, and type systems for programming languages. Such problems are 
typically NP-hard, but sometimes they are polynomial-time tractable. The question as to which 
[ CSPs are in P and which are hard has stimulated a lot of research in the past 10 years. For pointers 

to the literature, there is a recent collection of survey articles [10]. 

The constraint satisfaction problem CSP for a fixed (not necessarily finite) structure P 
^ i with a finite relational signature r is the computational problem to decide whether a given 

^ I primitive positive sentence is true in P. A formula is primitive positive if it is of the form 

3xi, . . . ,Xn- il^i A ■ ■ ■ A ipm where "01 is an atomic formula over P, i.e., a formula of the form 
R{yi, . . . ,yj) for a relation symbol R oi a relation from P. The structure P is also called the 
template of the CSP. 

The class of problems that can be formulated as a CSP for a fixed structure P is very large. It 
can be shown that for every computational problem there is a structure P such that the CSP for 
P is equivalent to this problem under polynomial-time Turing reductions [3]. This makes it very 
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unlikely that we can give good descriptions of all those T where the CSP for T is in P. In contrast, 
the class of CSPs for a finite structure F is quite restricted, and indeed it has been conjectured 
that the CSP for T is either in P or NP-complete in this case [12]. So it appears to be natural to 
study the CSP for classes of infinite structures T that share good properties with finite structures. 

In graph theory and combinatorics, there are two major concepts of finiteness for infinite 
structures. The first is w-categoricity: a countable structure is cj-categorical if and only if 
its automorphism group has for all n only finitely many orbits in its natural action on n- 
tuplcs [9, 19, 16]. This property has been exploited to transfer techniques that were known to 
analyze the computational complexity of CSPs with finite domains to infinite domains [6, 5, 7]; see 
also the introduction of [2]. 

The second concept of finiteness is the property of an infinite graph or structure to be locally 
finite (see Section 8 in [11]). A graph is called locally finite if every vertex is contained in a finite 
number of edges; a relational structure is called locally finite if its Gaifman graph (definition given 
in Section 2) is locally finite. Many conjectures that are open for general infinite graphs become true 
for locally finite graphs, and many results that are difficult become easy for locally finite graphs. 

In this paper, we initiate the study of CSPs with locally finite templates by studying locally 
finite templates that have a first-order definition in (Z; suae), where succ = {{x,y) | x = y + 1} is 
the successor relation on the integers. 

As an example, consider the directed graph with vertex set Z which has an edge between x 
and y if the difference, y — x, between x and y is either 1 or 2. This graph can be viewed as the 
structure (Z; Diffj^ 31,) where Diff 3} = {{x,y) \ x — y E {1,3}}, which has a first-order definition 
over {Z;succ) since -R{i,2}(2^; ?/) iff 

succ{x, y) V Bu, V. succ{x, u) A succ{u, v) A succ{v, y) . 

Another example is the undirected graph (Z; Dist^i 2}) with vertex set Z where two integers x, y 
are linked if the distance, \y — x\, is one or two. 

Structures with a first-order definition in (Z; succ) are particularly well-behaved from a 
model-theoretic perspective: all of those structures are strongly minimal [19, 16], and therefore 
uncountably categorical. Uncountable models of their first-order theory will be saturated; for 
implications of those properties for the study of the CSP, see [4]. In some sense, (Z; succ) constitutes 
one of the simplest infinite structures that is not tA.'-catcgorical. 

The corresponding class of CSPs contains many natural combinatorial problems. For instance, 
the CSP for the structure (Z;i?{]^ 3}) is the computational problem consisting to label the vertices 
of a given directed graph G such that if (x, y) is an arc in G, then the difference between the label 
for x and the label for y is one or three. It will follows from our general results that this problem is 
in P. The CSP for the undirected graph (Z; Dist{i^2}) is exactly the 3-coloring problem, and hence, 
NP-complete. This is readily seen if one observes that any homomorphism of a graph G into the 
template modulo 3 gives rise to a 3-coloring of G. In general, the problems that we study in this 
paper have the flavor of assignment problems where we have to assign integers to variables such 
that various given constraints on differences and distances (and Boolean combinations thereof) 
between variables are satisfied. We therefore call the class of CSPs whose template is locally finite 
and definable over (Z; succ) distance CSPs. 

In Section 6 we prove the following classification result for distance CSPs. 

Theorem 1 Let F be a locally finite structure with a first-order definition in {Z; succ) that is not 
homomorphically equivalent to a finite structure. Then either 

• The CSP for V is NP-complete. 
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• r has a modular median polymorphism ( see Section 5 ), and the CSP for V is in P. 

If a locally finite structure T with a first-order definition in (Z; succ) has a finite core, then 
a widely accepted conjecture about finite domain CSPs implies that the CSP for F is either NP- 
complete or in P. In fact, for this we only need the special case of the conjecture of Feder and 
Vardi [12] that states that the CSP for finite templates with a transitive automorphism group is 
either in P or NP-complete (see Section 7 for details). 

Our theorem shows that if F is not homomorphically equivalent to a finite core, then the CSP 
for F is NP-complete, or that F has a certain majority polymorphism, which we call modular median 
(defined in Section 5), and the CSP for F can be solved in polynomial time by local consistency 
techniques. Polynomial-time tractability results based on local consistency were previously only 
known for finite or ct;-categorical templates; we use the assumption that templates for distance 
CSPs are locally finite to extend the technique to non-w-categorical templates. 

On the way to our classification result we derive several facts about structures definable 
in (Z;s?icc), and automorphisms and endomorphisms of these structures, which might be of 
independent interest in model theory, universal algebra, and combinatorics. For example, we 
show that every injective endomorphism of a connected locally finite structure F with a first- 
order definition in (Z; succ) is either of the form x i— )■ —x -|- c or of the form x x + c for some 
c G Z. 

2 Preliminaries 

A finite relational signature r is a finite set of relation symbols Ri, each of which has an associated 
arity ki. A (relational) structure F consists of a set D (the domain) together with a relation 
Rf ^ D^^ for each relation symbol Ri from r. We consider only finite signature structures in this 
paper. 

For x,y G Z, let d{x,y) be the distance between x and y, that is, |x — y|. The relation 
{{x,y) I y = X + 1} is denoted by succ, and the relation {{x,y) \ d{x,y) = 1} is denoted by 
sym-succ. A k-ary relation R is said to be first-order (fa) definable in the r-structurc F if there is 
an fo-r-formula ^(xi, . . . , X}~) such that R = {{xi, . . . , Xk) : F |= (p{xi, . . . , x^)}. A structure A is 
said to be fo-definable in F if each of its relations is fo-definable in F. For example, (Z; sym-succ) 
is fo-definable in (Z; succ) (though the converse is false). 

The structure induced by a subset S of the domain of F is denoted by F[S']. We say that a 
structure is connected if it cannot be written as the disjoint union of two other structures. The 
Gaifman graph of a relational structure F with domain D is the following undirected graph: the 
vertex set is D, and there is an edge between distinct elements x,y E D when there is a tuple 
in one of the relations of F that has both x and y as entries. A structure F is readily seen to be 
connected if and only if its Gaifman graph is connected. The degree of a structure F is defined to 
be the degree of the Gaifman graph of F. The degree of a relation i? C Z'^ is defined to be the 
degree of the structure (Z; R). Throughout the paper, F will be a finite-degree relational striicture 
with an fo definition in (Z; succ). The notation (F, i?) indicates the expansion of F with the new 
relation R. 

An fo-formula @ is primitive positive (pp) if it is of the form 3xi, . . . ,Xi.9{xi, . . . ,Xi,Xi+i, 
. . . ,Xj) where 9 is a conjunction of atoms. Note that we consider the boolean false _L to be a pp- 
formula, and we always allow equalities in pp-formulas. A pp-sentence is a pp-formula with no free 
variables. Suppose F is a finite structure over a finite signature with domain D := {ai, . . . , as}. Let 
9rixi, . . . , Xs) be the conjunction of the positive facts of F, where the variables xi, . . . ,Xs correspond 
to the elements ai , . . . , . That is, R{xx^ , ■ ■ ■ , x\f. ) appears as an atom in 9y iff (oai i - ■ ■ i o-Xk ) ^ ■ 
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Define the pp-sentence 3xi . . . Xs-Ovixi, ■ ■ ■ , Xg) to be the canonical query of F. Conversely, for a 
pp-sentence G := 3xi . . . Xs-0(xi, . . . , Xg) we build the canonical database Tq to be the structure 
with domain {xi, . . . , Xg} of which 0{xi, . . . , Xg) lists the positive facts. 

For a structure F over a finite signature, CSP(F) is the computational problem to decide whether 
a given pp-sentence is true in F. It is not hard to sec that CSP(A) <p CSP(F) for any F and A 
with the same domain such that each of the relations of A is pp-definable in F (see [18]); here, <p 
indicates polynomial-time many-to-one reduction (in fact, logspace reductions may be used, though 
this is harder to see and requires the celebrated result of [20]). 

Let F and A be r-structures. A homomorphism from F to A is a function / from the domain of 
F to the domain of A such that, for each fe-ary relation symbol i? in r and each fc-tuple (ai, . . . , a^) 
from F, if (ai,...,afc) G , then (/(ai), . . . , /(a/t)) G R^. In this case we say that the map 
/ preserves the relation R. Injective homomorphisms that also preserve the complement of each 
relation are called embeddings. Surjective embeddings arc called isomorphisms; homomorphisms 
and isomorphisms from F to itself are called endomorphisms and automorphisms, respectively. The 
set of automorphisms of a structure F forms a group under composition. A (fc-ary) polymorphism 
of a structure F over domain D is a function / : — ?> D such that, for all m-ary relations R of F, 
if (ai,. . ■,al^) e R^, for ah i < k, then {f{a\,.. . ,aj),. . . ,/(a^,. . . ,aj^)) G R^ . 

A unary function g (over domain D) is in the local closure of a set of unary functions F (over 
domain D) if, for every finite D' Q D there is a function f E F such that g and /' agree on all 
elements in D' . We say the F generates / if / is in the local closure of the set F' of all functions 
that can be obtained from the members of F by repeated applications of composition. 

If there exist homomorphisms / : F ^ A and g : A ^ T then F and A are said to be 
homomorphically equivalent. It is a basic observation that CSP(F) = CSP(A) if F and A are 
homomorphically equivalent. A structure is a core if all of its endomorphisms are embeddings [1] 
- a core A of a structure F is an induced substructure that is itself a core and is homomorphically 
equivalent to F. It is well-known that, if a structure has a finite core, then that core is unique up 
to isomorphism (the same is not true for infinite cores). 

We could have equivalently defined the class of distance CSPs as the class of CSPs whose 
template is locally finite and first-order definable in (Z; s), where s is the unary successor function, 
since (Z; succ) and (Z; s) fo-define the same structures. The structure (Z; s) admits quantifier 
elimination; that is, for every fo-formula (j){x) there is a quantifier-free (qf) 4>'{x) such that (Z;s) 
1= Vx.4'{x) -H- (j)'{x) (this is easy to prove, and can be found explicitly in [13]). Thus we may have 
terms in (j)' of the form y = s^{x), where s^ is the successor function composed on itself j times. Let 
F be a finite signature structure, fo-definable in (Z; succ), i.e. qf-definable in its functional variant 
(Z;s). Let m be the largest number such that y = s™(x) appears as a term in the qf definition 
of a relation of F. Consider now CSP(F), the problem to evaluate ^ := 3xi, . . . ,xi^.(f)(xi, . . . ,Xk), 
where (/> is a conjunction of atoms, on F. Let S := {1, . . . ,k ■ (m + 1)}. It is not hard to see that 
F ^ $ iff r[S] ^ It follows that CSP(F) will always be in NP. 

3 Endomorphisms 

The main result of this section is the following theorem. 

Theorem 2 Let T be a relational structure with a first-order definition in (Z; succ) which has finite 
degree and which is connected. Then: 

• The automorphism group ofV equals either the automorphism group of {Z; succ), or that of 
(Z; sym-succ). 
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• Either T has a finite range endomorphism, or it has an endomorphism which maps T onto 
a subset of Z isomorphic to a structure fo- definable in (Z; succ) all of whose endomorphisms 
are automorphisms. 

The proof of this theorem can be found at the end of this section, and makes use of a series of 
lemmata. We assume henceforth that F is a relational structure with a first-order definition in 
(Z; succ) which has finite degree and which is connected. 

Before beginning the proof, we remark that although it is tempting to believe that the 
automorphism group of F equals that of (Z; sym-succ) iff F is fo-definable in the latter structure, 
this is not true: Let 

R := {{x, y, u, f ) G : (y = succ{x) Av = succ{u)) y {u = succ{y) A x = succ{y))}, 

and set F := (Z;i?). Clearly, F satisfies the hypotheses of Theorem 2. The function which 
sends every .t G Z to —x is an automorphism of F, so the automorphism group of F equals 
that of {Z; sym-succ), by Theorem 2. However, R is not fo-definable in {Z; sym-succ). To see 
this, suppose it were definable. For every positive natural number i, let sym-succ^ be the binary 
relation that says that the distance between two points equals i. Then R is also definable in 
{Z; sym-succ^ , sym-succ^ ,.. .), and even with a quantifier-free formula (f) since this structure has 
quantifier-elimination. Let n be the maximal natural number such that sym-succ^'' occurs in (p. 
Then a short moment of reflection suffices to see that 0(0, 1, ra-|-2, n-|-3) holds iff 0(0, l,n-\-3,n-\-2) 
holds, a contradiction since the former tuple is an clement of R whereas the latter is not. 

Denote by E the edge-relation of the Gaifman graph of F. It is clear that every endomorphism 
of F preserves E. We claim that there are < di < ■ ■ ■ < dn such that E{x, y) holds iff 
d{x, y) e {di ,dn}. To see this, observe that if x, y G Z are connected by E and u,v & Z are so 
that d{x, y) = d(u, v), then also u, v arc connected by E: This is because there is an automorphism 
of (Z; succ) (and hence of F) which sends {x, y} to {u, v} and this automorphism also preserves E. 
Hence, the relation E is determined by distances. Moreover, there are only finitely many distances 
since F is assumed to have finite degree. 

Definition 3 We will refer to the distances defining the Gaifman graph of F as di, . . . ,dn. We 
also write D for the largest distance dn- 

The following claim, very easy to prove, will be used a number of times. 

Claim 4 F is connected if and only if the greatest common divisor of di, . . . ,dn is 1. 

Proof: If d is the greatest common divisor of di, . . . , d„ it is clear that all the nodes accessible 
from a node x £ Z are of the form x-\-c-d where c G Z. Conversely, every node of the form x-\-c - d 
is accesible from x because c ■ d = ci ■ di -\- ■ ■ ■ -\- c^ ■ d^ for some ci, . . . , 

In order to lighten the notation we might use ex to denote e(x), where e is an endomorphism 
of F and x e Z. 

Lemma 5 Suppose that F is connected and of finite degree. Then there exists a constant c = c(F) 
such that for all endomorphisms eofVwe have d(e{x), e{y)) < d{x, y) + c for all x,y € Z. 

Proof: We first claim that for every < q < D, there exists a number Cq such that 
d(e{x),e{y)) < Cq for all endomorphisms e of F and all x, y G Z with d{x,y) = q. To sec this, 
pick u,v with d{u,v) = q and a path between u and v in the Gaifman graph of F; say this path 
has length Ig. Then, since this path is mapped to a path under any endomorphism, we have 
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d{e{u), e{v)) < D ■ Ig for all endomorphisms e. Since an isomorphic path exists for all x, y with the 
same distance, our claim follows by setting Cq := D ■ Ig. Set c to be the maximum of the Cg, and 
let an endomorphism e and G Z be given. Assume wlog that x < y. There exists m> and 
< q < D such that y = x + D- m + q. Set Xr := x + D - r, for all < r < m. Now 



Observe that a constant c(r) not only exists, but can actually be calculated given the distances 
di, . . . ,dn- In the following, we will keep the symbol c reserved for the minimal constant guaranteed 
by the preceding lemma. 

Lemma 6 Let e be an endomorphism ofT. If for all k > c + 1 there exist x, y with d{x, y) = k and 
d{e{x),e{y)) < k, then e generates a finite range operation whose range has size at most 2(c+ 1). 

Proof: Let ^ C Z be finite. We claim that e generates a function which maps A into a set of 
diameter at most 2c + 1. The lemma then follows by the following standard local closure argument: 
Let S be the set of all those functions a whose domain is a finite interval [— n; n] C Z and whose 
range is contained in the interval [— c;c], and which have the property that there exists a function 
generated by e which agrees with a on [—n; n]. By our claim, S is infinite. For functions a, j3 in S', 
write a < /3 iff /3 is an extension of a. Clearly, the set S*, equipped with this order, forms a finitely 
branching tree; since the tree is infinite, it has an infinite branch (this easily verified fact is called 
Konig's lemma) B C S. The branch B defines a function / from Z into the interval [— c;c]; since 
e generates functions which agree with / on arbitrarily large intervals of the form [— n;n], we have 
that / is generated by e, too. This completes the proof. 

Enumerate the pairs {x,y) € with x < y hy (xi, yi), . . . , (x^, j/r)- Now the hypothesis 
implies that there exists t\ generated by e such that d{t\X\,tiy\) < c + L Similarly, there exists t2 
generated by e such that ii(t2^ia^2) ^2^1^/2) < c + 1. Continuing like this we arrive at a function tr 
generated by e such that d(trtr-i ■ ■ ■ tix^^t^tr-i ■ ■ ■ tiyr) < c + 1. Now consider t := t^ o ■ ■ ■ oti. Set 
fj := tr o ■ ■ ■ o tj^i and gj := tjo ■ ■ ■ oti, for all 1 < j < r; so t = /j o gj . Then, since by construction 
d{9j{xj),gj{yj)) < c+1, we have that d{tXj,tyj) = d{fj{gj{xj)), fj{gj{yj)) < d{gj{xj),gj{yj)) + c < 
2c + 1 for all 1 < j < r, and our claim follows. ■ 

Lemma 7 // the hypothesis of the preceding lemma does not hold, i.e., if there exists k > c + 1 
such that d{ex, ey) > k for all x, y with d{x, y) = k, then either e{s + D) = e{s) + D for all s E Z 
or e(s + D) = e(s) — D for all s G Z. In particular, e does not generate a finite range operation. 

Proof: Let A; > c + 1 be so that d{ex,ey) > k for all x,y with d{x,y) = k. Let w G Z be 
arbitrary. Then, since d{e{w + k), e(w)) > k, we have e{w) ^ e{w + k); say wlog e{w + k) > e{w). 
Since d{w, w + k) = k we have e{w + k) > e{iv) + k. We claim that for all v G Z, e{v + k) > e{v) + k. 
Suppose not, and say wlog that there exists v > w contradicting our claim. Then, since 
d{e{v + k),e{v)) > k, we have e{v + k) < e{v) — k. Take the minimal v with v > w satisfying 
this property. Then, by minimality, we have e{v — 1 + A;) > e{v — 1) + k. Since by Lemma 5 
we have d{e{v — 1 + k),e{v + k)) < c+1, we get that e{v — 1)+A; — c— 1 < e{v + k). On 
the other hand, e{v) — c — 1 < e{v — 1). Inserting this into the previous inequality, we obtain 
e{v) — c— 1 + k — c— 1< e{v + k), which yields e{v) — 2c — 2 + A; < e{v + k). By our assumption 
on V, we obtain e{v) — 2c — 2 + A; < e{v) — k, which yields A < c + 1, a contradiction. 




d{exr, exr+i) + d{exm, ey) < D ■ m + d{exm, ey) 



0<r<m 



< d{x, y) + d{exm, ey) < d{x, y) + c. 
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Set b := k ■ D. We next claim that e{v + b) = e{v) + b for all t; € Z. Since 6 is a multiple 
of D and points with distance D cannot be mapped to points with larger distance, we get that 
e{v + b) < e{v) + b. On the other hand, since b is also a multiple of k and since e{v + k) > e{v) + k 
for all u G Z, we obtain e{v + b) > e{v) + 6, proving the claim. 

We now prove that e{v) + D < e{v + D) for all v € Z. This is because e{v) + kD = e{v) + b = 
e{v + b) = e{v + kD) = e{v + D + {k - 1)D) < e{v + D) + {k - 1)D, the latter inequality holding 
since D is the maximal distance in the relation E and cannot be increased. Subtracting {k — 1)D 
on both sides, our claim follows. 

Since D cannot be increased, we have e{v + D) < e{v) + D for all v G Z, and we have proved 
the lemma. I 

The following lemma summarizes the preceding two lemmas. 

Lemma 8 The following are equivalent for an endomorphism e ofV: 

(i) There exists k> c + 1 such that d{ex, ey) > k for all x, y G Z with d{x, y) = k. 
(a) e does not generate a finite range operation. 

(Hi) e satisfies either e{v + D) = e{v) + D or e{v + D) = e{v) — D. 

Proof: Lemma 7 shows that (i) implies (ii) and (iii). It follows from from Lemma 6 that (ii) 
implies (i). Finally, it is clear that (iii) implies (ii). ■ 
We know now that there are two types of endomorphisms of F: Those which are periodic with 
period D, and those which generate a finite range operation. We will now provide examples showing 
that both types really occur. 

Example 9 Let R := {{x,y) : d{x,y) = 1 V d{x,y) = 3}, and set F := {Z;R). Set e{3k) := 3k, 
e(3k + 1) := 3A; + 1, and e{3k + 2) := 3k, for all /c G Z. Then e is an endomorphism ofV that does 
not generate any finite range operations since it satisfies e{v + 3) = e{v) + 3 for all v E Z. 

Observe that in the previous example, we checked that e is of the non-finite-range type by virtue 
of the easily verifiable Item (iii) of Lemma 8 and without calculating c(F) , which would be more 
complicated. 

Example 10 For the structure F from Example 9, let e be the function which maps every x G Z 
to its value modulo 4. Then e is an endomorphism which has finite range. 

Example 11 Set R := {{x,y) : d{x,y) G {1,3,6}} and S := {{x,y) : d{x,y) = 3}. Then 
F := (Z; R, S) has the endomorphism from Example 9. However, it does not have any finite 
range endomorphism. To see this, consider the set Z3 := {3m : m G Z}. If e were a finite 
range endomorphism, it would have to map this set onto a finite set. Assume wlog that e(0) = 
and e(3) > 0. Then e(3) = 3 as e preserves S. We claim e{s) = s for all s G Z3. Suppose to 
the contrary that s is the minimal positive counterexample (the negative case is similar). We have 
e{s — 3) = s — 3 and hence, as e preserves S, e(s) G {s — 6, s}. // we had e{s) = s — 6, then 
e(s — 6) = s — 6 and {s — 6, s) € R yields a contradiction. 

Example 12 Let F = {"L; sym-succ), and let e be the function that maps every x to its absolute 
value. Then e does not have finite range, but does generate a function with finite range (of size 2). 

The proof of Lemma 7 generalizes canonically to a more general situation. 
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Lemma 13 Let e be an endomorphism of T satisfying the various statements of Lemma 8. Let q 
be so that d{x, y) = q implies that d{ex, ey) < q. Then e satisfies either e{v + q) = e{v) + q for all 
V E. 7,, or e{v + q) = e{v) — q for all v E Z. 

Proof: This is the same argument as in the proof of Lemma 7, with D replaced by q. ■ 

Definition 14 Given an endomorphism e of T, we call all positive natural numhers q with the 
property that e{y + q) = e{v) + q for all v Eli or e{v + q) = e{v) — q for all v & Z stable for e. 

Observe that if e satisfies the various statements of Lemma 8, then D is stable for e. Note 
also that if p, q are stable for e, then they must have the same "direction" : We cannot have 
e{v + p) = e{v) + p and e{v + q) = e{v) — q for all v eTL. 

Lemma 15 Let e satisfy the various statements of Lemma 8, and let q be the minimal stable 
number for e. Then the stable numbers for e are precisely the multiples of q. In particular, q 
divides D. 

Proof: Clearly, all multiples of q are stable. Now for the other direction suppose that p is 
stable but not divisible by q. Write p = m ■ q + r, where m, r are positive numbers and 
< r < q. Since r is not stable, applying e and shifts we can find a term t such that t{0) = and 
d{t{mq),t{p)) 7^ r. By the property of p we should have t{p) = p. But this is impossible since then 
d{t{mq),t{p)) = d{mq,p) = r, a contradiction. ■ 

Lemma 16 Let e be an endomorphism of V satisfying the statements of Lemma 8. Let q be its 
minimal stable number. Then e can be composed with automorphisms of (Z; succ) to obtain an 
endomorphism t with the following properties: 

• t satisfies either t{v + q) = t{v) + q or t{v + q) = t{v) — q 

• t{0) = 

• t[Z] = {q- z : z eZ}. 

Proof: Assume 1 < q (otherwise t can be chosen to be the identity and there is nothing 
to do). We claim that e generates a term ti such that ti(0) = and G {g • z : z G Z}. 

To see this, observe that since 1 < q and since q is the smallest positive number with the 
property that d{x,y) = q implies d{ex,ey) < q, there exist xo,yo G Z with d{xo,yo) = 1 and 
d{exo,eyo) > 1. Write ri := d.{exo, eyo). If ri is not a multiple of q, then there exist xi,yi £ Z 
with d{xi,yi) = ri and d{exi, eyi) =: r2 > ri. Again, if r2 is not a multiple of q, then there 
exist X2,y2 £ Z with d{x2,y2) = ^2 and d{ex2,ey2) =■ r^ > r2- Consider the sequence {xi,yi) of 
pairs of distance (setting ro := 1). By exchanging Xj+i and y^+i if necessary, we may assume 
that Xj+i < i/i+i iff exi < eyi, for all i. There exist automorphisms ai of (Z; succ) such that 
{ai{e{xi)), ai{e{yi))) = (xj+i, yj+i). Set Sj := ctj o e o Q!.j_i o ■ ■ ■ o ao o e. Then the endomorphism 
Si sends {xo,yo) to (xj+i,yj+i), a pair of distance rj+i > > • • • > tq. Thus the sequence must 
end at some finite i. By construction of the sequence, this happens only if r^+i is a multiple of q. 
Therefore, r^+i = d{si{xo), Si{yo)) G {q ■ z : z E Z}. By applying shifts we may assume xq = 0, 
yo = 1, and Sj(0) = 0. Set ti := Sj. 

Now if 2 < g, then consider the number ti(2). We claim that e generates a term t2 such that 
^2(0) = and t2{ti{2)) is a multiple of q. If already ti(2) is a multiple of q, then we can choose t2 
to be the identity. Otherwise, we can increase the distance of ti{2) from successively by applying 
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shifts and e just as before, where we moved away 1 from 0. After a finite number of steps, we arrive 
at a term t2 such that d(i2(0), ^2^1(2)) is a multiple of q. Applying a shift one more time, we may 
assume that t2(0) = 0, and so t2 has the desired properties. 

We continue inductively, constructing for every i < q a term ti such that UiO) = and 
ti o ■ ■ ■ o ti{i) is a multiple of q. At the end, we set t := o ■ ■ ■ o ti. Since e satisfies either 
e{v + q) = e{v) + q 01 e{v + q) = e{v) — q, so does t, as it is composed of e and automorphisms of 
(Z; succ). It is also clear from the construction that t{0) = holds. These two facts together imply 
that t[Z] contains the set {q ■ z : z & Z}. For the other inclusion, let t; G Z be arbitrary, and write 
V = q ■ z + r, where z G Z and < r < g'. Then t{v) = q ■ z + t{r) or t{v) = —q ■ z + t(r), which is a 
multiple of q since t{r) is a multiple of q by construction. ■ 

Observe that we did not need local closure in the preceding lemma. 

Lemma 17 Let e be an endomorphism ofT which is not an automorphism o/(Z; sym-succ). Then 
e is not injective. 

Proof: If e generates a finite range operation then the lemma follows immediately, so assume 
this is not the case. Then e has a minimal stable number q. Since e is not an automorphism of 
(Z; sym-succ), we have q > 1. But then the statement follows from the preceding lemma, since the 
function t is not injective. ■ 

Lemma 18 Let e be an endomorphism of F which is not an automorphism of (Z; sym-succ) and 
which does not generate a finite range operation. Then e is not surjective. 

Proof: This is a direct consequence of Lemma 16, since being surjective is preserved under 
composition. ■ 



Proof: (of Theorem 2) Wc prove the first statement. It is a direct consequence of Lemma 17 
that the automorphism group of F is contained in that of (Z; sym-succ). Since F is fo-definable 
in {Z;succ), its automorphism group contains that of (Z;succ). The statement now follows from 
the easily verifiable fact that there are no permutation groups properly between the automorphism 
groups of (Z; succ) and (Z; sym-succ). 

For the second statement, suppose that F has no finite range endomorphism. If all of its 
endomorphisms are automorphisms, then we are done. Otherwise, F has an endomorphism t as in 
Lemma 16, with q > 1. Consider the substructure A of F induced on the image t[Z] = {q-z : z G 
of t. We claim that A is isomorphic to a structure A' with domain Z which has a first-order 
definition in (Z; succ). Indeed, let R be any relation of F, and let (p be the formula defining R in 
(Z; succ). For all i £ 00, let sued be the binary relation on Z which says about a pair {x,y) G Z^ 
that y = X + i. Then adding the succ^ to the language, (j) can be assumed to be quantifier- free. 
Now construct a formula (j)' as follows: For all i € t^J not divisible by q, replace every occurrence 
of sued by Va;(x 7^ x). For all other i, replace every occurrence of sued by succi . Let R be the 
relation defined by (f)' on Z. Then one readily checks that (t[Z],i?) (where R is restricted to the 
domain t[Z]) is isomorphic to (Z;i?') via the isomorphism which sends every x G t[Z] to |. Thus, 
defining A' to have exactly the relations of the form R' , where i? is a relation of F, we get that A' 
is indeed isomorphic to A. Clearly, A' is fo-dcfinable in (Z; succ). 

Since A is the image of an endomorphism of F, one readily checks that the Gaifman graph of A 
coincides with the induced subgraph of the Gaifman graph of F on t[Z]. Thus in A, two points x, y 
are adjacent iff d{x,y) G {di, . . . dn}; moreover, d{x,y) is divisible by q. Therefore, the remaining 
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relevant distances are those divisible by q. In other words, if {d,^, . . . , dj^} are those distances from 
{di, . . . , dn\ which are divisible by then the Gaifman graph of A' is isomorphic to the graph on 
Z defined by the distances {^, . . . , ^}. Since before, the greatest common divisor of all possible 
distances was 1, wc must have lost at least one distance, i.e., r < n. 

Observe that A (and hence A') is connected as it is the image of an cndomorphism of T. Note 
moreover that A cannot have a finite range endomorphism: If s were such an endomorphism, then 
sot would be a finite range endomorphism for F, contrary to our assumption. If all endomorphisms of 
A are automorphisms, then we are done. Otherwise A (more precisely, A') satisfies all assumptions 
that we had on T, and we may repeat the argument. Since in every step we lose a distance for 
the Gaifman graph, this process must end, meaning that we arrive at a structure all of whose 
endomorphisms are automorphisms. ■ 



4 Definability of Successor 

In this section we show how to reduce the complexity classification for distance constraint 
satisfaction problems with template F to the case where either F has a finite core, or the relation 
succ is pp-definable in F. We make essential use of the results of the previous section; but note 
that in this section we do not assume that F is connected. 

Theorem 19 Every finite degree relational structure F with a first-order definition in (Z; succ) is 

either homomorphically equivalent to a finite structure, or to a connected finite-degree structure 
A with a first order definition in (Z; succ) which satisfies one of two possibilities: CSP{A) (and, 
hence, CSP{r)) is NP-hard, or succ is definable in A. 

The following lemma demonstrates how the not necessarily connected case can be reduced to 
the connected case. 

Lemma 20 Every finite degree relational structure F with a first-order definition in (Z; succ) is 
homomorphically equivalent to a connected finite-degree structure A with a first order definition in 
(Z; succ). 

Proof: If the Gaifman graph of F does not contain any edges, then the statement is clear. 
Otherwise, let g be the greatest common divisor oi di, . . . ,dn (the distances in the Gaifman graph, 
see Section 3, Definition 3). If F is connected, there is nothing to prove. 

Otherwise, if F is disconnected, we have g > 1. Then F must be a disjoint union of g copies of 
a connected structure A (and these copies are isomorphic to each other by an isomorphism of the 
form X X + d, for appropriate constant d). In particular, F is homomorphically equivalent to A. 
Moreover, we claim that A itself has a first-order definition in (Z; succ). The proof here is as in the 
proof of Theorem 2, with g taking the role of g. ■ 

The following is obvious. 

Lemma 21 Let (ai, . . . , ak), (6i, • • • , bk) £ Z'^. Then there is an automorphism a of (Z; succ) with 
a{ai) = bi for all i < k if and only if ai — Uj = bi — bj for all 1 < i, j < k. 

Lemma 22 Suppose that F is connected. Then there is an no such that F[{1, . . . ,n}] is connected 
for all n > no . 
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Proof: Let di be the smallest distance of the distances {di, d„} defining the Gaifman graph G 
of r (as in Section 3). By connectivity of G, for each pair a,b of elements from {!,... ,di} there 
is a path from o to 6 in G. Fix such a path for each pair a,b. Let no be the smallest number 
such that all vertices on those paths are smaller than no- We claim that r[{l, . . . , n}] is connected 
for all n > uq. To sec that c,d < n are connected, observe that both c and d are connected to 
vertices in {1, . . . , di} (via a sequence of vertices at distance di). Since all vertices in {1, . . . ,di} 
are connected in r[{l, . . . ,no}] by construction, we conclude that c and d are connected by a path 
inr[{l,...,n}]. ■ 

Lemma 23 Suppose that F is connected and of finite degree. Then there is an Hq and c such that for 
all n> no and any homomorphism f from r[{l, . . . , n}] to T we have that d{f{x),f{y)) < c+d{x, y) 
for all X, y G {1, . . . , n}. 

Proof: Let no be the number from Lemma 22. Then for all n > no, the structure r[{l, . . . , n}] is 
connected. Now, proceed as in Lemma 5. ■ 

Proposition 24 Let T he a connected finite-degree structure with a first-order definition in 
(Z;succ). Assume that every endomorphism of F is an automorphism of {Z; sym-succ). Then 
for all ai, 02 G Z there is a finite S C Z that contains {01,02} such that for all homomorphisms f 
from F[S] to F we have d{f{ai),f{a2)) = 0^(01,02). 

Proof: Suppose that there are oi < 02 G F such that for all finite subsets S of elements of F 
that contain {01,02} there is a homomorphism from T[S] to F where (i(/(ai), /(02)) 7^ ^(01,02). 
We have to show that F has an endomorphism that is not an automorphism of (Z; sym-succ). Let 
5 be a subset of Z that contains {01,02}, and let f,g be functions from — >■ Z. Then we define 
/ ~ 5 if there exists an automorphism a of F such that f{x) = a{g{x)) for all x E S. We call such 
a function good if it is a homomorphism from T[S] to F where (i(/(ai), /(02)) 7^ ^(01,02). Observe 
that since all automorphisms of F preserve distances, if one function in an equivalence class is good, 
then all other functions in the equivalence class are also good. 

Let no be the number from Lemma 23, and let ni be max(no, |oi|, I02I). Consider the following 
infinite forest T: the vertices are the equivalence classes of good functions / : y — >■ Z for 
V = {— n, . . . , n}, for all n > ni, and T has an arc from one such equivalence class F to another 
H if there are f & F, h & H, such that / is a restriction of h, and / is defined on {— n, . . . , n}, and 
h is defined on {— n — 1, . . . , n + 1}, for some n G N. Observe that 

• by our assumptions the forest T is infinite; 

• by Lemma 23, for every n > ni there is a 6 such that d{f{x),f{y)) < b for all x,y & 
{— n, . . . , n}. Using Lemma 21 it follows that T is finitely branching; 

• the forest T has only finitely many roots. 

By Konig's tree lemma, there is an infinite branch in T. It is straightforward to use this infinite 
branch to construct an endomorphism / of F with (i(ai, 02) 7^ d(/(oi), /(02)). This endomorphism 
cannot be an automorphism of (Z; sym-succ), which concludes the proof. ■ 

Corollary 25 Suppose that T is a connected finite-degree structure with a first-order definition 
in (Z;s'ucc), and suppose that all endomorphisms of F are automorphisms ofT. Then either the 
relation sym-succ^ = {{x,y) \ d{x,y) = k} is pp-definable in F for every k > 1, or the relation 
sued' = {{x, y) \ X — y = k} is pp-definable in F for every k > 1. 
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Proof: First consider the case that F is preserved by the unary operation x i— t- — x, and let 
A; > 1 be arbitrary. Let 01,02 be any two elements of Z at distance k. Since all endomorphisms 
of F are automorphisms of F, they are automorphisms of (Z; sym-succ) by the first statement of 
Theorem 2. Hence we may apply Proposition 24, and there is a finite set 5 C Z such that every 
homomorphism / from T[S] to F satisfies d(/(ai), f{a2)) = ^(01,02). Let (j){ai, 02) be the primitive 
positive formula obtained from the canonical query for T[S] by existentially quantifying all vertices 
except for a\ and 02- We claim that (?!) is a pp-definition of sym-succ^ = {{x,y) \ d{x,y) = k}. 

The relation defined by (f) contains the pair (01,02) (since the identity mapping is a satisfying 
assignment for the canonical query F[S']), and since F is preserved by all automorphisms of 
{7,; sym-succ) it also contains all other pairs {x,y) £ 1? such that d{x,y) = k = ^(01,02). 
Conversely, (f) does not contain any pair {x,y) with d{x,y) ^ k. Otherwise, there must be a 
assignment f : S ^ Z that satisfies the canonical query and maps oi to x and 02 to y. This 
assignment is a homomorphism, and therefore contradicts the assumption that (i(/(ai), /(02)) = 
d(ai,a2)- This proves the claim. 

Now consider the case that F is not preserved by the unary operation — . As before we use 
Theorem 2 and Proposition 24 to construct a primitive positive formula (j). This time it is easy to 
see that defines the relation {{x,y) \ x — y = k}. ■ 

Proposition 26 Suppose that for all k the relation sym-succ^ = {{x,y) G Z^ | d{x,y) = k} is 
pp-definable in F. Then CSP{r) is NP-hard. 

Proof: Observe that the primitive positive formula 3y.d{x,y) = lAd{y, z) = 5 defines the relation 
R = {{x,z) I d{x,z) € {4,6}}. The structure ["L^R) decomposes into two copies of the structure 
(Z; 5) where S = {(x, y) | d{x,y) G {2,3}}. This structure has the endomorphism x 1-^ x mod 5, 
and the image induced by this endomorphism is a cycle of length 5, which has a hard CSP (this is 
well-known; for a much stronger result on undirected graphs, see Hell and Nesetfil [14]). ■ 

Proof: (of Theorem 19) By Lemma 20, we can assume without loss of generality that F is 
connected. Clearly, if F has a finite range endomorphism, then it has a finite core. Otherwise, 
by Theorem 2, there is an endomorphism of F that maps F onto a subset of Z isomorphic to an 
induced substructure A of F which is first-order definable in (Z; succ), and where all endomorphisms 
are automorphisms. Being the homomorphic image of the connected structure F, A must also 
be connected. We now apply Corollary 25 to A. If the relation {{x,y) \ d{x,y) = k} is pp- 
definable in A for every A; > 1, then CSP(F) (which is equal to CSP(A) since F and A are 
homomorphically equivalent) is NP-hard by Proposition 26. Otherwise, by Corollary 25, the relation 
{(x, y) \ x — y = k}, and in particular the relation succ is pp-definable in A. ■ 

5 The Power of Consistency 

All tractable distance constraint satisfaction problems for templates without a finite core can be 
solved by an algorithmic technique known as local consistency. We prove these tractability results 
in this section. 

A majority operation on a set X is a mapping / : — >■ X satisfying 

/(x, X, y) = f{x, y, x) = f{y, x,x) = x . 

An n-ary relation i? on a set X is 2- decomposable if R contains all n-tuples (ii, . . . ,tn) such 
that for every 2-element subset / of {1, . . . , n} there is a tuple s G i? such that ti = Si for all i £ I. 
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We need the following concept to prove the algorithmic results in this paper. Let A be a 
structure with a (not necessarily finite) relational signature r, and let ^ be a conjunction of atomic 
r-formulas with variables V. For A; > 0, we say that is k-consistent (with respect to A) if for 
every assignment ct of A; — 1 variables xi, . . . , Xk-i € ^ to elements from A and for every variable 
Xk EV the assignment a can be extended to such that all conjuncts of (j) that involve no other 
variables than satisfied over A by the extension of a. We say that (f) is strongly 

k-consistent if 4> is j-consistent for all j with 2 < j < k. We say that (f) is globally consistent if is 
A;-consistent for all /c > 0. 

The following has been shown in [17] (with an explicit comment in Section 4.4 that the result 
also holds on infinite domains). 

Theorem 27 (Special case of Theorem 3.5 of [17]) Let T be a structure with a majority 
polymorphism. Then every relation RofTis 2- decomposable. Moreover, every strongly 3-consistent 
conjunction of atomic formulas is also globally consistent with respect to F. 

In the proof of the following proposition, and in the next section, it will be convenient to 
represent binary relations R C. 7? with a first-order definition in (Z; succ) by sets of integers as 
follows: the set S represents the binary relation Rs := {{x,x + k) \ k € S}. Conversely, when Ris a 
binary relation with a first-order definition in (Z; succ), let S{R) be the set such that Rs(r) = R- It 
is easy to see that every binary relation of finite degree and with a first-order definition in (Z; succ) 
is of the form Rs for some finite S. 

Theorem 28 Let T be a finite degree structure with a first-order definition in (Z; succ). IfF has 
a majority polymorphism, then CSP{r) is in P. 

Proof: Let D be the largest distance in the Gaifman graph of T, as in Section 3 (Definition 3). 

Let $ be an input instance to CSP(r) with variables V = {xi, . . . , ,x„}. and let (j) be the quantifier- 
free part of Our algorithm and its correctness proof work independently for each connected 
component of the canonical database of (j). So let us assume that the canonical database of ^ is 
connected. 

Let A be the expansion of T by all binary relations R with a pp-definition in F such that R = Rs 
for S C {—nD, . . . , nD}. Moreover, A contains a binary relation symbol F that denotes Rz = Z^. 
Let T be the signature of A. Since all relations of A are pp-definable in F, it follows that A has 
the same majority polymorphism as F. 

We claim that we can compute from ^ a r-formula with variables V such that 

• (f) implies ip, and 

• 4> All) IS strongly 3-consistent with respect to A 

in time polynomial in the size of (j). This can be done as follows. 

The algorithm maintains for each pair of variables x^^xi with k,l G {l,...,n} and k ^ I a 
binary relation symbol P = P^^'O from r. The set S{P^) either equals Z or it is a subset of 
{—nD, . . . ,nD}; in the latter case, we use this subset to represent the relation in the algorithm. 

We first describe how to initialize the Let k,l € {l,...,n} be distinct. If x^ and xi 

are non-adjacent in the Gaifman graph of the canonical query for 0, then we set 
Otherwise, there is a conjunct T{xi^, . . . ,Xi^) of (f) with T a relation symbol from F such that 
k,l € {ii, . . . , im}- For the sake of notation, suppose that ij = j for all 1 < j < ?ti, and that k = 1 
and I = 2. Let R{xi,X2) be the relation with the pp-definition 3x3, ■ ■ ■ ■> Xm-T{xi, . . . , Xm) over F. 
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Clearly, S{R) C {-D, ...,D}. Hence, R is & relation from A. We then set P(^'^) to R. Note that 
R{xi,X2) is implied by cp in A. 

We now describe how to obtain stronger and stronger consequences of ^ by local propagation. 
Let Xk,Xi,Xra be three distinct variables from (f). Consider the case k — 1) ^ — 2, and 
m = 3, again for the sake of notation. Then let R{xi,xs) be the binary relation defined by 
3x2. P^^'^\xi, X3) A P^^''^\xi,X2) A P^'^'^\x2,X3) over A. It is straightforward to verify that R is 
pp-definable in T. We will show below that either S{R) C {—nD, ... , nD} or S{R) = Z. Hence, R is 
a relation from A and we replace p(^'^) by R. We call this replacement step proper if R was different 
from p(^'^). Again, note that R{xi,x^) is implied by (p in A. Also note that a representation of R 
can be computed from the representations of P^^'^^, p(2'3), and P^^'^^ in polynomial time. Moreover, 
S{R) C 5(P(1'3)). 

We perform such replacements until for each k, I the binary relation p(*^'') cannot be changed 
further by the above replacement steps. It is clear that we will reach this state after an at most 
cubic number of proper replacement steps: the reason is that when a relation that is represented 
by a subset S of {—nD, . . . ,nD} is replaced, the representation of the replacing relation will be a 
proper subset of S. Since there are only quadratically many relations where replacements can be 
made, the claim follows. 

We still have to show that S{R) C {—nD, . . . ,nD} or S(R) = Z for R{xi,X3) defined by 
3x2- P^^'^^{xi,X3) A P^^'^^Xi,X2) A P(^'^)(x2, Xs) as above. The proof is by induction on the 
distance I of xi and X3 in the Gaifman graph G of the canonical database of 4>; in fact, we show the 
stronger claim that cither S{R) = Z or S{R) C {—ID, . . . ,ID}. If the distance is 1, then already 
by the initialization we have that ^(P^^'^)) C {-D,...,D}, and hence S{R) C {-D,...,D}. 
If S{P^^'^^) C {—ID, . . . ,ID}, then R C {—ID,..., ID} and we are done; so assume that 
5'(p(i'3)^ = Z. Let /i be the distance between xi and X2, and I2 be the distance between X2 
and X3 in G. If ^(P^i'^)) = Z or 5(P(2'3)) = Z then S{R) = Z and we are done. Otherwise, by 
inductive assumption we have that ^(P^^'^)) C {-hD, hD} and S{P^^'^^) C {-I2D, I2D}, 
and because I < h + I2 it follows that S{R) C {—ID, . . . , ID}. 

To conclude the proof, let ip be the formula /\^ i^^^ „| P^^'''\xk,xi). It is straightforward 
to verify that A is 3-consistcnt with respect to A. Clearly, it is also 2-consistent. Since A has 
a majority polymorphism. Theorem 27 implies that ^ A ^ is globally consistent. 

When P^'^'O = for some k, I then ip is unsatisfiable in A; and $ is false in F. Otherwise, global 
consistency implies that the sentence . . . ,Xn- </> A ■0 is true in A, since we can map xi to an 
arbitrary value in Z, and then successively extend this map to a mapping that satisfies (p Aifj. It is 
clear that in this case $ is true in F as well. This shows that truth of a given formula $ in F can 
be decided in polynomial time. ■ 

Definition 29 The d-modular median is the ternary operation nid : Z'^ — > Z defined as follows: 

• If x,y,z are congruent modulo d, then md{x,y, z) equals the median of x,y,z. 

• If precisely two arguments from, x, y, z are congruent modulo d then md{x, y, z) equals the first 
of those arguments in the ordered sequence {x,y,z). 

• Otherwise, md{x, y, z) = x. 

Clearly, c?-modular median operations are majority operations. 

Corollary 30 Let T he a finite-degree structure with a first- order definition in (Z; succ) and a finite 
relational signature, and suppose that F has a modular median polymorphism. Then CSP{r) is in 
P. 
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6 Classification 



In this section wc finisli tlic complexity classification for those T that do not have a finite core. The 
main result of Section 4 shows that, unless T has a finite core, for the complexity classification of 
CSP(r) we can assume that the structure F contains the relation succ. In the following we therefore 
assume that the structure T contains the relation succ; moreover, we freely use expressions of the 
form X — y = d, for fixed d, in primitive positive definitions since such expressions have themselves 
pp-definitions from succ and therefore from T. Our main result will be the following. 

Theorem 31 Let T be a first-order expansion of [Jj-^succ). Then V is preserved by a modular 
median and CSP{r) is in P, or CSP{r) is NP-hard. 

A d'progression is a subset of Z of the form {k, k + d, . . . ,k + Id}, for some k,l E Z. We shall 
denote {k,k + d, . . . ,k + Id} by [k, k + ld]d- 

Proposition 32 Let R C. TP be a finite-degree binary relation with a first-order definition in 
(li-jSucc). Then the following are equivalent. 

1. R is preserved by the d-modular median m^; 

2. R = Rg for a d-progression S. 

Proof: (1^2). If 5 is not a d-progression then there exist some a < b E S such that x ^ S 

for all a < X < 6 and b — a ^ d. If a — 6 is not multiple of d then fix v to be any integer smaller 
than min(S'), otherwise fix v to be a + d. Let u be such that {u,v) G Rs- If we apply rud to 
{u,v), (0, a), (0,6) we obtain {0,v) which does not belong to Rs- 

(2 ^ 1). Let {xi,yi),i = 1,2,3 be arbitrary tuples of Rs- For every i,j G {1,2,3}, 
Xi — Xj = yi — yj mod d- This implies that if a rule of the Definition 29 is applied to the Xj's 
then the same rule is applied to the y^'s. Then we only need to care about the first rule since the 
other two act as a projection. Let {xa, yb) be the result of applying the median to {xi, y^), i = 1, 2, 3 
and let m (resp. M) be the minimum (resp. maximum) integer n such that (x^, n) G Rs- Since S is 
a d-progrcssion, (xa, yj) belongs to Rs if — = mod d and m < yb < M. The first condition 
follows from the fact that we apply the first rule. The second condition is shown as follows. By 
the definition of median there is some / G {1,2,3}, I a, such that xi > Xa- Hence we have that 
yi and ya are at least m, which implies that the median of the second coordinates, yb, also is. A 
symmetric argument shows that yb < M. 

■ 

Proposition 33 Let a,b be two odd numbers such that a < b. Then CSPiX; succ,R^Q^a,b,a+b}) 
NP-hard. 

Proof: Let k be the integer Note that the primitive positive formula 

(l){x,z) = ^y-R{o,a,b,a+b}ix,y) Ay-z = k 

defines the relation C := {ix,z) | d{x,z) G {^^,^^}}- Consider the mapping / : Z — >■ 
{0, 1} defined by f{x) = x mod 6. It follows from = — mod 6 that / is a 

endomorphism of C. It also follows by the same reason that the restriction, D, of C to {0, . . . ,5—1} 
is a graph where every node has two edges. Furthermore if m is gc d(^, ^) then D is the disjoint 
union of m cycles of ^ nodes. Since ^ is odd we have that CSP(Z;C) is NP-hard (this follows 
from [14]). 
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Lemma 34 Let a,b,c with b ^ c. Then CSP{2,\ suae, R{a,b}-,R{a,c}) is NP-hard. 

Proof: First observe that the formula 3w. ^) ^ ^ = + « pp-defines the relation -R{o,6-o}; 

similarly, there is a pp-definition of i?{o,c-o} Let d = b — a and e = c — a; we will show that 

CSP(Z; succ, i?{o,d}) -^{o,e}) is NP-hard. 

The relation defined by 3u,v. R^Q^cij{x,u) A i?{o,e}(^^; ^) is -R{o,d,e,d+e}- If both d and e are 
odd, we obtain hardness of the CSP from the previous proposition applied to (Z; succ, R{o^d,e,d+e}) ■ 
If both d and e are even, then the structure A := (7^; R^q ,^^, R^q ,,^, {{x,y) \ x — y = 2}) is pp- 
dcfinable in P. The structure A is isomorphic to the disjoint union of two copies of the structure 
(Z; succ, R{o^d/2}j R{o,e/2})] claim now follows by induction on e. 

Finally, assume that precisely one of d or e is even; say d is even. Set u := lcm{d,e)/d and 
V := lcm{d,e)/e. The formula 

...,yu,zi,.. .,zy. R{Q,d}{p,yi) A R{Q,d}{yi,y2) a • • • a R{o,d}{yu-i,yu) a R{o,d}{yu,q) 

Ai?{0,e}(P, 21 ) A i?{o,e}(2l, 22) A • • • A R^o^e}{Zv-l, Zy) A R{o,e}{Zv, Q) 

with free variables p and q defines R{o,icni{d,e)}- 

We are now again in the case that we can pp-define two relations R{o,g} and R{o,h} fo^' even g, h 
(namely, g = d and h = lcm{d, e)), and thus we are done. ■ 

Lemma 35 Let S be a finite set of integers with \S\ > 1 and let d be the greatest common divisor 
of all a — a' with a, a' G S. Assume (this assumption is only for ease of notation) that all elements 

of S are of the form i ■ d where i ^TL. Let md = min(S'), Md = max(5'), let [jd, kd]d be a maximal 
d-progression in S, let I be such that I > max(j — m — 1, M — k — 1,0) and such that k > j + I. 
Then every d-progression with r = k — j — l + l elements is pp-definable in (Z; succ, Rs)- 

Proof: For every < i < /, let Ri{x, y) be the pp-formula 3z.(z = x + id) A Rs{z, y) which can be 
built using Rs and succ. Finally let Rt be the relation defined by Ri{x,y). We claim that 

T is precisely [{j + l)d, kd]d- 

We have T C 5 because the formula contains Ro{x, y). Let x = nd be any element of S. Let us 
do a case analysis. 

1. Case m < n < j — 1. The smallest y such that Rj-rn-i{0,y) holds is {j — l)d. Hence in this 
case X r. 

2. Case j — 1 < n < j + 1. By the maximality of [jd, kd]d it follows that {j — l)d S. Henceforth 
we have that Ri{0, nd) does not hold if we pick i = n — j + 1. Hence x ^T. 

3. Case j + I < n < k. For every 1 < i < Z, we have that Ri{0, x) holds aiSj + i<n<k + i. 
This implies that x eT. 

4. Case k < n < M. By the maximality of [jd, kd]d we have that {k + l)d ^ S. Hence by 
choosing i = n — {k + 1) we have that Ri{0, x) does not hold. Consequently x ^T 

Finally, it is obvious that if T is a cZ-progression with r elements, then Rt can be defined by 
the pp-formula 3z.{z = x + p) A R[(j+i)d,kd]ai^j y) by choosing p = max(T) — kd. ■ 

Let us illustrate the construction of Rt in the previous proof with an example. Assume S is 
the set {1, 4, 5, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 18, 19, 20} which we can represent as: 

I 1 I 2 I 3 I 4 I 5 I 6 I 7 I 8 I 9 I 10 I 11 I 12 I 13 I 14 I 16 I 16 I 17 I 18 I 19 I 20 I 21 I 22 I 23 I 24 I 25 I 26 I 
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We have d = 1. Consider the 1-progression [7, 16]i m S. Then we have m = 1, M = 20, j = 7, 
and k = 16. Fix I = 6. For every < i < I let Zi such that = Ri- Then we have: 
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Lemma 36 Let S he a finite set of integers with \S\ > 1 and let d be the greatest common divisor 
of all a — a' with a, a' G S. For any d-progression T, the relation Rt is pp-definable in (Z; suae, Rs) ■ 

Proof: The set of maximal ci-progressions contained in S can be totally ordered by setting 
Ti < Ts if min(ri) < min(T2). If Ti < Ta then we define the distance from Ti to T2 to be 

min(r2) — max(Ti). 

A d-progression is non-trivial if it contains at least two elements. For any m > 1, let (Rs)"^ 

m 

, ^ 

be the relation Rg o Rs o ■ ■ ■ o Rg which we can write as Rs"i where 5"* contains all integers that 
we can express as ai + • • • + Um with ai, . . . , € S. By the definition of d it follows that if m 
is large enough there exists some integer a, such that {a, a + d) C. S, or, in other words, that S'^ 
contains a non-trivial d-progression. For ease of notation we shall assume that S already contains 
a non-trivial d-progrcssion (otherwise replace S by 5""). 

For any m > 1, let be the maximum distance between two consecutive non-trivial maximal 
d-progressions contained in S"". Also, let (resp. l^) be minimum (resp. maximum) with the 
property that {min(S'™) + /^,min(5™) + |- +d} C S"^ (resp. {max(5™) -1+ -d, max(5™) - /+ } C 
S'^). Finally define Im to be max(/~,Z+). For ease of notation we shall write = 3,1^= l~ and 
so on. 

Claim 1. I2 < I- Proof: Follows from the fact that {2 min(S') + l~ ,2 min(5') -|- /~ -|- d, 2 max(S') — 
Z+ -d,2max(5) -/+} C S^. 

Claim 2. If ^ = then 712 < n. 

Proof: Let X < y be consecutive non-trivial maximal d-progressions contained in S*^. We claim 
that there exist non-trivial maximal d-progressions A < B in S such that max(yl) -|- max(i?) < 
max(X). Indeed, set A = B to he the maximal d-progression containing {min(5'), min(S') -|- d}. 
Consequently, we can choose A < B satisfying the conditions of the claim with max(A) -|- max(S) 
maximal. 

Since X < y it follows that max[A) < max(5) which implies that there exists a maximal d- 
progression C in 5 with B < C {in particular consider the one containing {max(5) — d, max(S')}). 
Pick any such C with min(C) minimal. 

Since S contains d-progressions A and B it follows that S'^ contains the (not necessarily 
maximal) non-trivial d-progression [min(74) -|- min[B) , msiK{A) + max{B)]d. Let X' be a maximal 
progression in 5^ containing it. Similarly let Y' he a maximal d-progression in S"^ containing 
[min(i?) -|- min(C), max(S) -|- max(C)]d. By the choice of A, B, and C it follows that X' < X and 
X <Y'. As y is consecutive to X it follows that Y <Y' . We finish by proving that the distance 
from X' to y', min(i?) -|- min(C) — max.{A) + max(i?), is strictly smaller than n. Indeed, by the 
minimality of C, min(C) — max(^) < n and since B is non-trivial max(i?) — min(i?) > 0. This 
finishes the proof of Claim 2. 
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Let T be any arbitrary d-progression and let r be its size. From claim 1 and 2 it follows that 
the value of / does not increase if we replace S hy S^. Since max(5) — min(5) certainly increases it 
follows that we can assume (by replacing S by S'^ for sufficiently large m) that max(S') — min(S') > 
31 + {r — l)d. By applying iteratively claim 2 to S" = 5 n [min(S') + l~ , max(S') — wc conclude 
that S*^" contains the d-progression S"^" = [2"(min(S') + 2"'(max(S') —l~^)]d- The result follows 
by applying Lemma 35 to S"^" . ■ 

Proposition 37 Let T be a structure with only binary relations of finite degree with a first-order 
definition in {^\succ). Then either V is preserved by a modular median, or CSP{'L\ succ) is NP- 
hard. 

Proof: Assume first that F contains some relation Rg for S = {ai, . . . , Ofc}, ai < • • • < a^, that is 
not a d-progression for any d. Then there exists some i such that — ai-i ^ Oj+i — a^. Let d be the 
gcd. of all ai — aj with i,j G {1, . . . , k}. By Lemma 36, relation -R[a._^,a-]^ is pp-definable. Then 
we obtain R{aj^i,ai} with -R[a._-^_a.]^ ni?^. Similarly one obtains R{ai,ai+i}- The result follows from 
Lemma 34. Hence we are left with the case in which F contains (i-progrcssion Rs and (i'-progrcssion 
Rt with d / d'. Fix arbitrary integers a, b. In a similar way to the previous case, one can pp-define 
R{a,a+d} and R{b,b+d}- We apply again Lemma 34 to conclude the proof. ■ 

Proof: (of Theorem 31) Assume F is not preserved by a modular median. We can assume that 
there exists some d > such that every binary relation pp-definable in (F, succ) is a d-progression, 
otherwise we are done by Proposition 37. Let S be any relation in F. If 5 is 2-decomposable then 
S is invariant under m^. Consequently, there is a relation S" in F that is not 2-deconiposablc. This 
implies that, by projecting out coordinates from S, we can obtain a relation R of arity r > 3 which 
is not (r — l)-decomposable. This implies, in particular, that there exists a tuple (ai, . . . ,0^) i? 
such that for all 1 < i < r, (ai . . . , ai_i,pi, Oj+i, . . . , a^) G R for some integer pj. By replacing R 
by the pp-defined relation 



we can further assume that = for all 1 < ? < r. Furthermore, we can also assume that for 

all 1 < i < r. Pi € {—d,d}. Indeed let 1 < i < r with pi > and assume pi is minimal. Note 
that Pi is a multiple of d since otherwise for any j ^ i, the projection of R to {i,j} would not be 
a d-progression. Define Si to be the d-progression [0,Pi — d]d- Then the relation pp-defined by the 
formula 



satisfies the condition at coordinate i. If p^ < one only needs to define Si to be the \pi + d,0]d 
and proceed in the same way. 

We claim that we can pp-define a relation U of arity > 3 such that (0, . . . , 0) U and 
{(u, 0, 0, ... , 0), (0, It, 0, ... , 0), (0, 0, -u, . . . , 0)} C [/ for u = d or u = -d. 

Let P be the set of all 1 < i < r such that Pi = d and let Nhe{l,..., r}\P. If max(|P|, \N\) > 3 
then we are done as U can be obtained by permuting the coordinates of R. 

Otherwise, as r > 3 we can pick i G N and j € P. The desired relation U can be obtained by 
permuting the coordinates of the (2r — l)-ary relation defined by 

^x, y. {y = x + d) A R{xi, Xi-i,x, Xi,...,Xr) /\ R{yi, yj-i,y, yj,...,yr) A i?[o,d]^(2;, y). 




l<i<r 



3yi.Rsi{xi,yi) AR{xi,...,Xi-i,yi,Xi+i,...,Xr) ■ 
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Consider the relation V over domain {0, d} given by 

{ix,y,z) e{0,d}^ I {x,y,z,0,...,0)eU} . 

By its construction we have that (0,0,0) (and hence {d,d,d)) is not in V. Assume first that 

{(d,0,0),(0,d,0),(0,0,(i)} G U (the case {(-d, 0, 0), (0, -d, 0), (0, 0, -d)} £ U is analogous). It 
follows that V does not belong to any of the six Schaefer classes, and, hence, that the boolean 
CSP({0, d}; V) is NP-complete. We shaU show that CSP({0, d}; V)) reduces to CSP(Z; succ, U). 
Let $ be an input instance of CSP(F) with variables X and let cf) be the quantifier-free part of 
We construct an instance of CSP(Z; succ,V) in the following way: The set of variables of $ 
is X U {y} where y is a new variable not occurring in X. For every atomic formula V{x\,X2iXz) in 
(j) we include in the quantifier- free part, 0', of $' the conjunction 

A -^{0,4 (y> Xi) MJ{xi,X2,xz,y,. . . , y) 

l<i<r 

Clearly (j)' has a satisfying assignment if it has one setting y to 0. Also an assignment of the variables 
of 4>' setting y to satisfies 4>' if and only if its restriction to X satisfies (p. ■ 

Proof of Theorem 1. Suppose that T does not have a finite core. Let A be the substructure 
of r as described in Theorem 19. Clearly, CSP(r) and CSP(A) are the same problem. Unless 
CSP(r) is NP-hard, the relation succ is pp-definable in A. By the fundamental theorem of pp 
definability, the CSP of the expansion of A by the successor relation has the same complexity as 
CSP(A). Now the claim follows from Theorem 31. ■ 

7 Concluding Remarks 

Structures with a first-order definition in (Z; succ) have a transitive automorphism group, i.e., for 
every a;, y € Z there is an automorphism of P that maps x to y. We call such structures F transitive 
as well. It is well-known and easy to prove (see e.g. [15]) that a finite core of a transitive structure is 
again transitive. It follows from the main result obtained in our work that a complete classification 
of the computational complexity of finite degree distance problems would follow from a classification 
of the complexity of transitive finite CSPs. 

In general, the complexity of the CSP for finite transitive templates has not yet been classified. 
The following is known. 

Theorem 38 (of [8]) Let T be a finite core. If there is a primitive positive interpretation of the 
structure A := ({0, 1}; {(0, 0, 1), (0, 1, 0), (1, 0, 0)}) in T, then CSP{r) is NP-complete. 

The following conjecture is widely believed in the area. 

Conjecture 39 (of [8]) Let T be a finite core. If there is no primitive positive interpretation of 
the structure A := ({0, 1}; {(0, 0, 1), (0, 1, 0), (1, 0, 0)}) in F, then CSP{T) is in P. 

Wc believe that this conjecture might be easier to show for transitive finite cores only. Note 
that by transitivity, the polymorphism algebra of F has no proper subalgcbras. Since F is a core, 
all polymorphisms are surjective. It follows from known results [8] that, unless CSP(F) admits a 
primitive positive interpretation of A, all minimal factors of the polymorphism algebra contain an 
affine operation. 
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